AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Multimodal adaptation

# Multimodal adaptation

Webssl Dino7b Full8b 378
A 7-billion-parameter vision Transformer model trained on 8 billion language-unlabeled web images, achieving exceptional visual representation capabilities through self-supervised learning
Image Classification Transformers
W
facebook
68
0
Tiny Random Phi 4 Multimodal
This is a tiny model for debugging, randomly initialized based on the adjusted configuration, specifically designed for rapid process verification.
Image-to-Text Transformers
T
katuni4ka
41.78k
0
Aimv2 1b Patch14 224.apple Pt
AIM-v2 is an image encoder model based on the timm library, with a scale of 1 billion parameters, suitable for image feature extraction tasks.
Image Classification Transformers
A
timm
198
0
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase